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Abstract 

To sense numerous odorants and chemicals, animals have evolved a large number of olfactory receptor genes (Olfrs) in their 
genome. In particular, the house mouse has —1,100 genes in the Olfr gene family. This makes the mouse a good model 
organism to study Olfr genes and olfaction-related genes. To date, whether male and female mice possess the same ability in 
detecting environmental odorants is still unknown. Using the next generation sequencing technology (paired-end mRNA- 
seq), we detected 1,088 expressed Olfr genes in both male and female olfactory epithelium. We found that not only Olfr 
genes but also odorant-binding protein (Obp) genes have evolved rapidly in the mouse lineage. Interestingly, Olfr genes tend 
to express at a higher level in males than in females, whereas the Obp genes clustered on the X chromosome show the 
opposite trend. These observations may imply a more efficient odorant-transporting system in females, whereas a more 
active Olfr gene expressing system in males. In addition, we detected the expression of two genes encoding major urinary 
proteins, which have been proposed to bind and transport pheromones or act as pheromones in mouse urine. This 
observation suggests a role of main olfactory system (MOS) in pheromone detection, contrary to the view that only accessory 
olfactory system (AOS) is involved in pheromone detection. This study suggests the sexual differences in detecting 
environmental odorants in MOS and demonstrates that mRNA-seq provides a powerful tool for detecting genes with low 
expression levels and with high sequence similarities. 

Key words: mRNA-seq, olfactory epithelium, olfactory receptor, odorant-binding protein, major urinary protein, sexual 
differentiation. 



Introduction 

There are two olfactory systems in vertebrates: the main 
olfactory system (MOS) and the accessory olfactory system 
(AOS). MOS is composed of the main olfactory bulb and 
the olfactory epithelium (OE), while AOS is composed of 
the accessory olfactory bulb and the vomeronasal organ 
(VNO). OE is the main tissue for the expression of olfactory 
receptors (ORs), while VNO is the tissue for the expression 
of the vomeronasal receptors (VNRs). The detection of 



odorants and chemicals by ORs or by VNRs is transmitted 
to the brain by the main olfactory bulb or the accessory 
olfactory bulb, respectively. It is commonly thought that 
in mammals, ORs are responsible for recognizing environ- 
mental volatile odorants, such as food odors, whereas 
VNRs are responsible for detecting chemical cues related 
to social behaviors, such as pheromones (Touhara 
2007). In this study, we examine whether there are sexual 
differences in the expression profiles of genes expressed 
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in OE, particularly those involved in the odorant detection 
in mammals. 

In the odorant perception process, hydrophobic mole- 
cules are transported through the nasal mucus by odorant- 
binding proteins (OBPs) to ORs. OBPs belong to the lipocalin 
protein family, which are extracellular transport proteins for 
carrying small hydrophobic molecules in aqueous solutions 
(see the review by Godfrey et al. 2004). They can be divided 
into subclasses based on the tissue specificity and the func- 
tion in mammals such as OBPs and major urinary proteins 
(MUPs) (Flower et al. 2000). In mouse, three Obp genes 
(Obpla, Obplb, and Obp2) have been identified and all 
of them are located on the X chromosome (Bocskei et al. 
1992; Briand, Blon, et al. 2004; Briand, Trotier, et al. 
2004; Meslin et al. 201 1). Obpla and Obplb form hetero- 
dimers to carry odorant molecules (Pes et al. 1998; Utsumi 
etal. 1999). In rat, Obp1 , Obplf, Obp2a, Obp2b, and Obp3 
were identified (Briand et al. 2000). However, only Obp1 
and ObpM are located in the syntenic region of mouse 
Obp genes on the X chromosome. 

The OR genes (O/frs) form one of the largest gene fam- 
ilies, composing of hundreds of copies in the mammalian 
genome. The existence of a large number of Olfr genes 
in a genome may confer the animal the ability to detect nu- 
merous kinds of odorants. It appears that the house mouse 
(Mus musculus), a nocturnal mammal, relies heavily on the 
sense of smell to perceive the environmental cues as it pos- 
sesses — 1,000 functional Olfr genes. The mouse Olfr genes 
have been classified into at least 241 subfamilies and are 
mapped to 51 loci distributed on 17 chromosomes with 
the largest cluster on chromosome 2 (Tegoni et al. 2000). 

Sex-specific behaviors due to sex differences in responses 
to stimuli, such as colors, courtship songs, and chemosen- 
sory cues, have been found in animals (Godfrey et al. 2004). 
The chemosensory cues in animals include pheromone mol- 
ecules, which are detected specifically by VNRs in mouse 
(Rubenstein and Lovette 2009). Sex-specific behaviors can 
be initiated by sex-specific pheromones (Kurtovic et al. 
2007; Wyart et al. 2007; Haga et al. 2010) or by sex- 
differential expression of VNRs in response to the same pher- 
omones (Haga etal. 2010; Roberts etal. 2010). For example, 
a male-specific pheromone, Darcin (the MUP 20, encoded 
by Mup20), in the mouse urine only attracts females and 
stimulates their memory (Herrada and Dulac 1997). As an- 
other example, both male and female mice detect exocrine- 
secreted peptide 1 (ESP1), but only female-specific mating 
behaviors are stimulated (Roberts et al. 2010), indicating the 
existence of gender-specific neuronal circuits. In contrast, 
no evidence of sexual differences in MOS has yet been 
reported. Thus, it remains unclear whether males and 
females have the same ability in sensing odorants. 

To address the question of sexual differentiation in MOS, 
an approach with a high resolution and great sensitivity in es- 
timating mRNA levels is required. Previous authors have used 



oligonucleotide microarrays and single-read mRNA-seq to 
study the expression of Olfr genes in OE and in matured ol- 
factory sensory neurons (OSNs), respectively (Haga et al. 
2010; Magklara et al. 2011). However, the hybridization- 
based gene expression profiling methods are not ideal for 
the studies of Olfr genes for two reasons: First, a series of re- 
cent expansions of the Olfr gene family have produced many 
Olfr gene pairs with a very high sequence similarity. Second, 
Olfr genes are expressed in a "one neuron one receptor" man- 
ner (Zhang et al. 2004), leading to extremely low gene expres- 
sion levels per unit tissue weight. For tissue-specific genes 
involving high possibilities of cross-hybridization between 
probes on arrays (Serizawa et al. 2004), a sequencing ap- 
proach is more suitable for specific quantifications. In addi- 
tion, the expression profiles of matured OSNs, where Olfr 
genes are expressed predominantly, do not show the com- 
plete expression of Obp genes. Hence, we used the lllumina 
paired-end mRNA-seq to obtain the transcriptomes of OE in 
both sexes, which are suitable for studying the expression 
patterns of Olfr genes and Obp genes. 



Materials and Methods 

Mice Collection and Total RNA Isolation 

All the animals used in this study were processed following 
the approved protocol of Institutional Animal Care and Use 
Committee at the National Laboratory Animal Center 
(NLAC), Taipei, Taiwan. 

Pups from brother-sister matings of the BALB/cByJNarl 
(BALB/c, a common inbred strain), from NLAC were used 
for this study. To minimize environmental variations be- 
tween the two sexes, we sacrificed mature pups at 4 weeks 
of age and isolated their OE tissue before male and female 
individuals were separated. Only litters with at least three 
males or three females were selected. The nasal tissue 
was isolated and preserved in RNALater solution (Ambion) 
immediately after sacrification. The tissue was allowed for 
penetration by RNALater solution at 4 °C overnight and then 
transferred to -20 °C before further isolation of total RNA. 
OE was dissected from the nasal tissue followed by total 
RNA isolation, using RNEasy Plus Mini Kit (Qiagen) with 
an additional on-column DNase treatment recommended 
by the manufacture (Qiagen). The 1 5 min DNase treatment 
was carried out at room temperature by mixing 1 0 ul DNase 
and 70 ul RDD buffer and applied to the RNA binding col- 
umn after the first wash. The RNA quantities and qualities of 
each individual were analyzed by Nanodrop and BioAnalyzer 
II (Agilent). If all three samples from the same litter passed 
the quality control (RNA integrity number [RIN] > 8.0), 1 0 ug 
of total RNA from each sample would be pooled to reach 
final of 30 ug total RNA for sequencing for each sex. 

RNA samples from the OE of C57BI76JNarl (B6, from 
NLAC, Taiwan) were used to confirm the expression of 



704 Genome Biol. Evol. 4(5):703-712. doi:10.1093/gbe/evs039 Advance Access publication April 17, 2012 



Sexual Differences in Gene Expression in the Olfactory Epithelium in Mouse 



GBE 



MUP genes. We collected tissues of livers, spleens, kidneys, 
testes, ovaries, hearts, and lungs for RNA isolation to con- 
firm the expression of Mup genes. Total RNA was isolated 
with the same approach described above. 

Paired-End mRNA-seq with the lllumina Genome 
Analyzer llx 

For pair-end mRNA-seq library preparation, we used lllumina 
mRNA-seq kits. A total of 30 ug total RNA was used for 
mRNA enrichment by oligo-dT beads followed by cation- 
catalyzed fragmentation for 4 min at 94 °C. The mRNA frag- 
ments were then converted into double stranded cDNA by 
random priming followed by end repair. The fragmented 
cDNAs were then ligated to the paired-end adaptors and 
subjected to size selection. For each pooled RNA sample, 
three sizes of —400 bp, —500 bp, and —550 bp were se- 
lected for the ligated cDNA. The three-gel purified cDNA li- 
braries were then subjected to 1 5 cycles of polymerase chain 
reaction (PCR) amplification and purified by Ampure beads 
(Beckman Agencourt). The absolute concentrations of the 
libraries were determined by Qubit fluorometry (Invitrogen) 
and BioAnalyzer High Sensitivity DNA Kit (Agilent). Each size 
selected mRNA-seq library was loaded in one lane of flow 
cell and paired-end 2 x 120 nt sequencing was conducted 
on lllumina Genome Analyzer llx, totaling three lanes of data 
per pooled transcriptome. Library preparation and lllumina 
sequencing was conducted by High Throughput Sequencing 
Core Facility, Biodiversity Research Center, Academia Sinica, 
Taiwan. 

Analysis of Paired-End Sequences 

We trimmed all the paired-end sequencing reads from both 
ends of each cDNA fragment to 90 bp to reduce sequencing 
errors. Paired-end sequencing reads were mapped to the 
genome with TopHat (version 1 .2.0) (Qian et al. 201 0; Xiong 
etal. 2010; Chang and Liao 201 1), allowing a 100-bp stan- 
dard deviation for a mean inner distance of 370 bp between 
paired reads from both ends. Only those pair-end reads 
mapped to the genome without mismatch were used for 
subsequent analyses. We first categorized mappable frag- 
ments into two groups: "unique" fragments, each of which 
was mapped to a single position in the genome, and 
"multiple-hit" fragments, each of which was mapped to 
more than one position in the genome. To calculate the ex- 
pression levels, unique fragments were assigned to individ- 
ual gene first for initial abundance estimation, and the 
multiple-hit fragments were then redistributed to those 
genes based on the relative abundances of uniquely mapped 
fragments. The total mappable fragments increased 2% by 
including redistributed multiple-hit fragments. The normal- 
ized expression levels of genes, measured in fragments per 
kilobase of exon per million fragments mapped (FPKMs), 
were calculated using Cufflinks (version 1.0.3) (Trapnell 



et al. 2009). Total mappable fragments on each chromosome 
were calculated by SAMtools (Trapnell et al. 2010). 

The mouse reference genome was from one of the com- 
mon lab inbred strains, C57BI76 (B6), while our transcrip- 
tomes were generated from another inbred strain, 
BALB/c. Single-nucleotide polymorphic sites between the 
two strains might lead to biased estimation of the expression 
level of transcripts. We therefore used a data set of 72-bp 
sequences generated by single-end mRNA-seq to confirm 
that there was no bias in estimation of expression levels 
when we mapped BALB/c cDNA sequences to the B6 
genome (supplementary data, including table S6, Supple- 
mentary Material online). 

Reverse Transcription PCR of Mup 
Genes and Quantitative Reverse 
Transcription PCR of Obp Genes 

A total of 2 ug RNA of each sample was reverse transcribed 
with MultiScribe Reverse Transcriptase (Life Technologies) in- 
to cDNA for both Reverse Transcription PCR (RT-PCR) and 
Quantitative Reverse Transcription PCR (qRT-PCR) reactions. 
Total RNA was incubated with RT enzymes at 25 °C for 10 
min prior to the RT reaction. RT reactions were performed at 
37 °C for 2 h followed by the inactivation of RTenzyme at 85 
°C for 10 s. For RT-PCR, 1 \i\ of 10x diluted cDNA was am- 
plified by Fast Start Taq DNA polymerase (Roche) in a total of 
1 0 ul reaction. For qRT-PCR, 1 of 1 0x diluted cDNA prod- 
ucts was quantified with 2x SYBR Green Master Mix (Kapa 
Biosystems) in a total of 10 ul reaction and performed on 
LightCycler 480 (Roche). 

The primers used in this study are as follows: 
Actb, 5'-CCA GTT CGC CAT GGATGA CGATAT-3' and 
5'GTC AGG ATA CCT CTC TTG CTC TG-3'. Omp, 5'-GAA 
GCA GGA TGG TGA GAA GC-3' and 5'-CGT GTC ATG 
AGG TTG GTG AG-3'. Mup4, 5'-AGA AGG ACG TGG 
TCC TGA CA-3' and 5'-TAA GTT CTG TCC CTT GGA 
AG-3'. Mup5, F: 5'-GAA AGA CCT GGT ACT GAG AG-3' 
and 5'-CTA GCT TCT TCT GCA TGG AC-3'. Gm14743, 
F: 5'-GGCATTCCAGCTGGAAACCTTAA-3' and 5'-CTTATG 
CTGTATCCTCACTT-3 ' . Obplb, F: 5'-GGCATTCCAGCTG- 
GAAACCTTAG-3' and 5 ' -CTTATGCTGTATC ATC ACTG-3 ' . 
Gm5938, 5 ' -ATATGCTGTGTC AAGCC AC A-3 ' and 5'-TAA 
CAGGTCGTAGATCATGAG-3'. Obpla, 5'-AAGGGAATTC 
CAGCTGGAAA-3' and 5'-GGAAGATCATGAGAAGGG- 
GAA-3'. 5430402E10Rik, 5 ' -G GTG AAGTTC CTG CTAATTG 
TGA-3' and 5'-CATCTGGACATGGAATTTGAC-3'. Obp2 
(Gm14744), 5'-GGTGAAGTTCCTGCTAATTGCGC-3' and 
R: 5 ' -C ATCTGG AC ATGG AATTTGAT-3 ' . 

Functional Enrichment and Evolutionary Analyses 

We looked for functionally enriched gene ontology (GO) 
terms in sex-biased genes against the rest of expressed 
genes (FPKMs > 0.05) in OE, using FatiGO (version 3.2) 
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(Li et al. 2009). The significance of differential expression 
between the two sexes of a given gene was tested by 
the f-test, using the FPKMs obtained from three cDNA librar- 
ies of each sex. 

The protein sequence of rat OBP1 was retrieved from the 
Ensembl database (ENSRNOP000000491 61 ), and that of rat 
OBP1f was obtained from Briand et al. (Al-Shahrour et al. 
2004). The protein sequences, including the hypothetical pro- 
teins, of mouse OBPs were retrieved from Ensembl database. 
Multiple sequence alignment was conducted by ClustalW2 
(http://www.ebi.ac.uk/Tools/msa/clustalw2/) and a neighbor- 
joining tree was constructed using MEGA5 (Tamura et al. 
2011). 



Results 

Transcriptome Data 

We used lllumina GAIIx to obtain the OE transcriptomes from 
one male sample and one female sample separately. Each 
sample was a pool of mRNA from three littermates of one 
gender. Three cDNA libraries with insert lengths ranging from 
400 to 550 bp were constructed for each sample. In total, we 
obtained —168 million and —203 million paired 120-bp reads 
in the male OE and in the female OE, respectively. From these 
reads, we obtained 1 38.9 million mappable-paired fragments 
for the male OE and 1 63.6 million for the female OE, corre- 
sponding to 83% and 81% mappable rates, respectively. 
Overall, we detected -21,000 genes with FPKM > 0.05 
(supplementary table S1, Supplementary Material online). 
In general, the transcriptomes of the male OE and the female 
OE were highly correlated (Pearson correlation coefficient, 
r = 0.94, P< 10" 15 ). 

Male-Biased Expression of Olfr Genes in Mice 

Among the 1 , 1 96 Olfr genes in the mouse genome, we de- 
tected 1,088 expressed Olfr genes in both male and female 
OEs (supplementary table S2, Supplementary Material on- 
line). Almost all Olfr genes were found to be very lowly ex- 
pressed in the transcriptomes (FPKM < 1 5). The expression 
levels of Olfr genes were highly correlated between the two 
sexes (r = 0.96, P< 10~ 15 ). Overall, 12 Olfr genes have no 
detectable expression in the female OE and six have no de- 
tectable expression in the male OE. However, the highest 
expression level of these genes is only 1 .82 FPKM and most 
of them are lower than 0.05 FPKMs (supplementary table 
S2, Supplementary Material online). Therefore, these 
"sex-specific" Olfr genes may not contribute to sex differen- 
ces in function. In addition to functional Olfr genes, we de- 
tected the expression of 57 of the 114 Olfr pseudogenes 
annotated in the mouse genome. 

To address the question of whether there are differences 
in Olfr expression levels between the two sexes, we com- 
pared the ranked expression levels of Olfr genes in the 
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Fig. 1. — Comparison of ranked expression levels of Olfr genes in 
the OE transcriptomes of male and female mice. The values of upper 
quartile, median, and lower quartile are indicated in each box, and the 
bars outside the box indicate semiquartile ranges. Male Olfr genes tend 
to express at a higher level (Mann-Whitney U test, P < 10~ 4 ). 



two OE transcriptomes. The medians of ranked expression 
levels of Olfr genes are around 20,000 among all 25,000 
genes in both sexes (fig. 1). We made the following inter- 
esting observations. First, Olfr genes tend to be expressed 
at a higher level in males than in females (fig. 1, P < 10~ 4 ). 
Second, all of the 254 Olfr genes that showed sexual pref- 
erence were expressed at a higher level in the male OE (sup- 
plementary table S3, Supplementary Material online). Third, 
the proportion of male-biased Olfr genes (254/1 ,088) is sig- 
nificantly higher than the proportion of the male-biased 
genes in the OE transcriptome (3,884/19,934) (Chi-square 
= 5.49, P = 0.02). Fourth, although chromosome 2 has 
the largest Olfr gene cluster, chromosome 7 showed the 
highest proportion of male-biased Olfr genes (86/228, sup- 
plementary table S3, Supplementary Material online). In 
conclusion, Olfr genes tend to be more strongly expressed 
in the male OE than in the female OE. 

We further confirmed the sexually biased expression of 
Olfr genes by examining gene enrichments in GO categories 
(table 1 ). We identified a total of 4,300 genes with a signif- 
icant expression difference between the two sexes. The en- 
richment analysis shows that these genes involve in 
biological processes, molecular function, and cellular com- 
ponents (supplementary table S5, Supplementary Material 
online), particularly in sensory perception of smell, trans- 
membrane receptor activity, and G-protein-coupled recep- 
tor activity categories (table 1). Almost 60% of genes in the 
categories of sensory perception of smell (GO:0007608), 
transmembrane receptors (GO:0004888) and G-protein- 
coupled receptors (GO:0004930) show different expression 
levels between the two sexes. As ORs are G-protein coupled 
receptors with seven transmembrane domains, the results 
confirm the differential expression of Olfr genes. 
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Table 1 

GO Categories Enriched with Sexually Biased Genes 







Sex-Biased 




Functional Categories 


GO 


Genes (%) 


P 


Sensory perception of smell 


0007608 


59.17 


0.00 


Transmembrane receptor activity 


0004888 


55.87 


0.00 


G-protein-coupled receptor 


0004930 


55.22 


0.00 



activity 

Note. — The proportion of sex-biased genes in each category is shown in 
percentage (%). For the complete list of GO categories enriched or depleted with 
sex-biased genes, please see supplementary table S5 (Supplementary Material online). 



Female-Biased Expression of X-Linked Obp Genes 

Comparing the paired-end fragments mappable to each 
chromosome, we found that the X chromosome had 2.8 
times more fragments in the female OE than in the male 



OE (supplementary table S4, Supplementary Material on- 
line). A comparison of the expression levels of individual 
genes on each chromosome revealed that the higher expres- 
sion of the X chromosome in the female OE was due to five 
Obp genes expressed abundantly in OE (fig. 2). Removing the 
FPKMs of these five genes from the transcriptomes resulted 
in a nonbiased expression pattern of the X-linked genes. 

The five genes are located in a 0.7-Mb region of the X 
chromosome. Two of them encode subunits of OBP1, 
Obpla and Obplb; the other three genes {Gm 14743, 
Gm14744, and 5430402 EWRik) have unknown functions. 
The predicted protein sequence of Gm 14744 was identical 
to the OBP2 identified previously, so we denoted this gene 
as Obp2. We further found that the Gm5938 gene is located 
within the 0.7-Mb region and shows sequence homology to 
Obpla; however, it has very low expression level compared 
with the other five Obp genes. As the 0.7-Mb region is 



1,000,000.00 



Obps, Lens and Mup4 



0 01 




1,000,000.00 



0.01 J 

Fig. 2. — Expression levels of each gene on autosomes and on chromosome X between the two sexes. The FPKM values were logarithmically 
transformed. The filled dots represent the genes on the X chromosome, whereas the opened dots represent the genes on autosomes. Only genes with 
detectable expression levels (FPKMs > 0.05) are shown. We found that the higher female mappable fragments of X-linked genes were due to five 
X-linked Obp genes expressed significantly higher in the female OE (grouped in the red color oval-shaped circle). Highly female-biased genes also 
included autosomal lipocalin protein genes (Lcnll, Lcn13, and Lcn14) and Mup4. The arrow indicates a highly expressed female-biased pseudogene 
(ENSMUSG00000082635) located in the X-linked Obp gene cluster. 
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Table 2 

Female-Biased Expression Patterns of Genes in the X-Linked Obp Gene Cluster 



Gene Name 


Chromosome Location 


Chromosome Strand 


Male (FPKM; 


Female (FPKM) 


M/r KatlOS 


P 


0.7-Mb region 














Obp la 


75,330,843-75,336,71 3 




27,620.10 


69,529.40 


0.40 




Obp lb 


75,432,115-75,437,631 




23,272.30 


75,798.30 


0.31 




Gm 14743 


75,485,800-75,491,260 




15,287.80 


34,579.90 


0.44 


* 


Gm5938 


75,370,805-75,375,742 




140.12 


348.02 


0.40 




Obp2 (Gm 14744) 


75,1 10,096-75,1 15,372 




14,899.80 


49,584.20 


0.30 


* * 


5430402E10Rik 


75,233,629-75,238,906 




34,157.80 


80,668.10 


0.42 


* * 


Flanking genes 














Prkx 


75,006,749-75,041,617 




27.75 


14.51 


1.91 


* 


Pbsn 


75,083,239-75,098,962 




0.21 


0.40 


0.52 


ns 


Prrgl 


75,715,282-75,829,194 




1.95 


1.48 


1.32 


ns 


ENSMUSG00000071735 


76,511,897-76,564,418 




1.97 


1.43 


1.38 


ns 



Note.— The significance of male (M)-to-female (F) FPKM ratios (M/F ratios) is shown as **P < 0.01, *P < 0.05, and ns: P > 0.05. 



enriched with Obp genes, we name it the "X-linked Obp 
gene cluster." 

All the genes in the X-linked Obp gene cluster were highly 
expressed in both sexes and showed a female-biased expres- 
sion pattern (table 2). We confirmed the female-biased ex- 
pression patterns of all the six genes in at least three mice by 
qRT-PCR assay (fig. 3). The nonbiased expression of Gm5938 
based on qRT-PCR may be because its expression level is very 
low, so that it is difficult to detect a difference by qRT-PCR. 
No female-biased expression was found in the genes located 
in the flanking regions of this 0.7-Mb region. In particular, 
probasin (Pbsn), which was proposed to be a member of the 
Obp gene family with a prostate-specific expression pattern 
(Utsumi et al. 1999), is located 27-Kb away from the gene 
cluster (table 2). We concluded that the female-biased ex- 
pression pattern is specific to the X-linked Obp gene cluster. 



Rapid Evolution of Mouse Obp Genes 

We found that Obp genes clustered on the X chromosome 
evolved rapidly, while maintaining multiple functional dupli- 
cate copies in the mouse genome. A multiple sequence 
alignment confirmed that the hypothetical proteins 
Gm5938, Gm14743, and 5430402E10Rik share sequence 
similarities with OBP1a, OBP1b, and OBP2, respectively 
(supplementary fig. S1/\, Supplementary Material online). 
A phylogenetic tree constructed from the protein sequences 
of the OBPs and MUPs expressed in OE revealed that at least 
four duplication events have occurred in the Obp gene clus- 
ter (fig. 4). One of the duplications resulted in the genes for 
the two Obp1 subunits, Obpla and Obplb. The other three 
duplications occurred independently in the mouse lineage, 
leading to the existence of three pairs of paralogous Obp 
genes: 1) Obpla and Gm5938, 2) Obplb and Gm 14743, 



Female 




Fig. 3. — qRT-PCR confirmed the female-biased expression of X-lined Obp genes of BALB/c mice at 4 weeks of age. The figure shows the 
expression levels of each X-linked Obp genes and olfactory major protein gene (Omp, a marker gene of matured OSNs) estimated by qRT-PCRs in the 
two sexes. At least three mice were used in the qRT-PCR assay. The error bars show the standard errors of gene expression levels among all the OE 
samples tested. The expression levels are significantly different between the two sexes: **P < 0.01, *P < 0.05; the expression levels are not different 
between the two sexes: ns, P > 0.05. 
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Fig. 4. — Phylogenetic tree constructed with protein sequences of OBPs and MUPs in mouse and rat. The maximum likelihood method was used with 
500 bootstrap replications. The bootstrap values are shown on the figure. The arrows indicate duplication events that occurred within the mouse genome. 



and 3) Obp2 and 5430402E10Rik. The branch lengths indi- 
cate that the paralog pairs, Obp1b/Gm14743 and Obp2/ 
5430402E10Rik, originated more recently than Obpla/ 
Gm5938. Only one and two amino acid differences were 
found between Obp1b/Gm14743 and between Obp2l 
5430402E10Rik, respectively. 

In rat, only Obp1 and Ofop7fwere identified in the syntenic 
region of the X-linked Obp gene cluster. The syntenic region is 
flanked by Pbsn and Proline rich Gla (G-carboxyglutamic acid) 

1 {Prrgl) in both rat and mouse (supplementary fig. SIB, 
Supplementary Material online). However, only one duplica- 
tion event was observed in the syntenic region on the X chro- 
mosome of rat, which resulted in Obp1 and Obplf. Thus, we 
concluded that the rapid evolution of Obp gene cluster was 
specific to the mouse lineage. 

The three novel genes {Gm14743, Gm5938, and 
5430402E10Rik) also maintained female-biased expression 
patterns based on the NGS data, as did their paralogs. This is 
likely due to the conservation of the upstream regulatory 
sequences. The upstream sequences of Obplb and 
Gm14743 are conserved up to 5 kb (the number of nucle- 
otide differences per site, n = 0.08), and the upstream se- 
quences of Obp2 and 5430402E1 ORik are conserved up to 
6 kb (71 = 0.10). However, the upstream region of Gm5938 
has many insertions and deletions and shows n = 0.43 for 
about 4 kb compared with that of Obpla. The loss of con- 
servation in the regulatory region might explain the rela- 
tively low expression of Gm5938 in OE. We also detected 
high expression levels of three Lcn genes, Lcn11, Lcn13, 
and Lcn14. The three genes are located on chromosome 

2 within a cluster of Lcn genes. The abundant expression 
was only detected in the three genes but not in the others 
(fig. 2). Interestingly, the three genes also showed a signifi- 
cant higher expression level in the female OE. Although the 
function of these genes has not been reported in the olfac- 



tory systems, Lcn14 was proposed to be an ortholog of 
OBP2A in human (Mouse Genome Informatics, The Jackson 
Laboratory). It is possible that there are more proteins that 
function as OBPs in MOS in order to increase the diversity in 
transporting various kinds of odorants. 

Expression of Mup Genes in OE 

Among Mup genes, including 21 Mup functional genes and 
the 21 Mup pseudogenes, we detected the expression of 
Mup4 and Mup5 in OE but not other Mup genes. These 
two genes are expressed abundantly; they are among the 
50 highest expressed genes in OE (fig. 2 and supplementary 
table S1, Supplementary Material online). The FPKMs of 
Mup4 are 1 0,720.6 and 2441 9.60 in males and in females, 
respectively. The FPKMs of Mup5 are 6591 .54 and 6840.84 
in males and in females, respectively. As MUPs were pro- 
posed to carry and transport pheromones in urine and/or 
act as pheromones themselves (Johnson et al. 2000), our 
result implies a possible role of MOS in pheromone percep- 
tion. We examined the expression patterns of the two genes 
in various types of tissue from two laboratory-inbred strains, 
BALB/c and B6, by RT-PCR. We found that Mup4 and Mup5 
were mainly expressed in the OE of both sexes in both strains 
(fig. 5). In addition, the expression patterns of Mup4 and 
Mup5 were identical in juvenile (4 weeks old) and adult 
(6 weeks old) mice (data not shown). Thus, the expression 
patterns of Mup4 and Mup5 in OE are not sex, age, or strain 
specific. 

Discussion 

To obtain a deep coverage of lowly expressed genes, partic- 
ularly Olfr genes, three cDNA libraries with different insert 
sizes were constructed for each RNA sample. We found that 
the expression levels of a given gene estimated from the 
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Fig. 5. — RT-PCR confirmed the expression of Mup4 and Mup5 in 
OE. Expression of Omp, Mup4, and Mup5, in various tissues of the two 
inbred strains. The /?-actin gene (Actb) was used as the positive control. 
OE_M indicates the male OE, and OE_F indicates the female OE. The 
expression patterns of Mup4 and Mup5 suggest that these two genes 
are mainly expressed in OE. 

three cDNA libraries were highly correlated (supplementary 
fig. S2, Supplementary Material online). Furthermore, the 
ranked expression levels of Olfr genes estimated from three 
cDNA libraries were not significantly different, except for the 
library with the shortest fragment sizes in the male OE (sup- 
plementary fig. S3, Supplementary Material online). Toung 
et al. (Beynon and Hurst 2003; Roberts et al. 2010) sug- 
gested that a minimum of 500 million reads is required 
to estimate FPKMs accurately for a human B cell transcrip- 
tome. In this study, we found no significant difference be- 
tween the FPKMs of Olfr genes estimated from a single 
cDNA library and those estimated from the pool of three 
libraries (supplementary fig. S4, Supplementary Material 
online). Therefore, it seems that a sample of —80 million 
120-bp paired-end reads can provide unbiased estimation 
of gene expression levels for the mouse OE transcriptome. 

Previous studies have used array-based and sequencing- 
based approaches to estimate the expression levels of Olfr 
genes in OE or matured OSNs (Toung et al. 201 1). Conduct- 
ing single-end mRNA-seq, Magklara et al. (2011) showed 
that the expression of Olfr genes ranged from 0 to 46 
RPKMs in matured OSNs (only 19 O/frs had RPKMs > 15, 
whereas 717 Olfrs had RPKMs >1). Our study shows similar 
results in the expression levels of Olfr genes with Magklara 
et al. (201 1), that is, Olfr genes are lowly expressed, consis- 
tent with the "one neuron one receptor" hypothesis 

Overall, our study has detected more expressed Olfr 
genes than the previous studies and has identified Olfr 
genes and novel duplicate Obp genes that are expressed dif- 
ferentially between the two sexes. Thus, mRNA-seq, partic- 
ularly paired-end technology, provides a high resolution in 



estimating expression levels of genes from highly duplicated 
gene families, such as Olfr and Obp genes. 

Sexual Differences in the Olfactory Systems 

Sexual dimorphisms have been found in the olfactory sys- 
tems of mouse and fly. In mice, sexual dimorphisms were 
found in AOS, where pheromones are perceived by VNRs 
(Zhang et al. 2004; Clowney et al. 2011; Magklara et al. 
2011). In flies, olfaction is mediated by OSNs in sensilla 
of the third antennal segment and the maxillary palps. Each 
OSN usually expresses one (sometimes two) of the 62 Olfr 
genes, and OBPs are secreted in the perilymph by support 
cells in the sensilla. Unlike vertebrates, ORs and OBPs in flies 
are responsible for sensing both odorants and pheromones. 
Although Zhou et al. (Kurtovic et al. 2007; Wyart et al. 2007; 
Haga et al. 2010) showed sexual dimorphic expression pat- 
terns of Obp genes in flies, the dimorphic patterns varied 
among genes. That is, some of the Obp genes were ex- 
pressed at a higher level in males and some were expressed 
at a higher level in females. In contrast, in our study all of the 
Obp genes and the other Lcn genes showed a consistently 
higher expression level in female mice at 4 weeks of age, 
whereas Olfr genes tend to have a higher expression level 
in males. The high expression of Obp genes in both sexes 
indicates an important function of OBP in OE. However, it 
requires experiments to test whether the differential expres- 
sion in Obp genes lead to differences in the ability of smell- 
ing odorant between the two sexes. These observations 
raise an interesting question as to whether sexual dimor- 
phism exists in other inbred lab strains or in the wild house 
mice, and what the advantage of evolving dimorphic expres- 
sion patterns in MOS is. Further research is required to ad- 
dress these questions. 

Rapid Evolution of Obp Genes 

We found that several duplication events have produced 
multiple copies of Obp genes in the X-lined Obp gene cluster 
in the house mouse, increasing the potential in transporting 
more kinds of odorants. We noted that these genes have 
evolved rapidly, similar to the rapid evolution of Lcn genes 
observed in other species (Zhou et al. 2009). 

Several models have been proposed for the evolution of 
duplicate genes (Stopkova et al. 2010; Meslin et al. 201 1). 
One model proposes that duplicate genes tend to reduce 
their expression levels in order to achieve dosage balances, 
that is, the same amount of proteins would be produced 
before and after the duplication (Lynch and Force 2000; 
Shiao et al. 2008; Kaessmann et al. 2009), especially for 
genes involved in the protein complexes (Qian et al. 
2010). In mouse, OBP1 is a heterodimer protein formed 
by two subunits, OBP1a and OBP1b. We found that each 
subunit has a duplicate copy in the mouse genome and both 
are expressed. According to the above model, the duplicate 
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genes, Gm5938 (Obp1a-dup) and Gm 14743 {Obp1 b-dup), 
and the parental genes, Obpla and Obplb, should all re- 
duce their expression levels to rebalance the dosage of 
OBP1 . However, Obpla, Obplb, and Obp1 b-dup were ex- 
pressed at high levels, whereas Obp 1a-dup was expressed at 
an extremely low level. It is possible that Obp1a-dup is un- 
dergoing pseudogenization. For this possibility, we note the 
great sequence divergence between Obp1a-dup and 
Obpla. 

A pseudogene, Gm14750 (ENSMUSG00000082635), in 
the X-linked Obp gene cluster was highly expressed in OE 
(male FPKM = 4,463, female FPKM = 12,172). The pseu- 
dogene resides between Obp2 and 5430402E10Rik (Obp2- 
duplicate) on the X chromosome and was determined as 
a pseudogene because of a prematured stop codon. Inter- 
estingly, even though not sharing sequence homology with 
the Obp genes in the cluster, Gm14750 showed a female- 
biased expression pattern as well (fig. 2). Although pseudo- 
genes are not functional in most cases, it has been proposed 
that they may play a role in regulating their paralogous part- 
ners (Papp et al. 2003; Yang et al. 2003; Chang and Liao 
2011) and a role in pathological defense in bacteria (Lim 
et al. 2004). The expression of Gm14750 indicates that 
all of the genes in the X-linked Obp gene cluster share 
the same regulation mechanism. 

A Potential Role of MOS in Pheromone Perception 

We found that the expression of Mup4 and Mup5 is not 
strain, age, or sex specific in OE. MUPs were found in urine 
and a number of secretary glands, such as lachrymal, mam- 
mary, and salivary glands (Zhou et al. 2009). In liver, MUPs 
are produced and secreted into serum, which is followed by 
a rapid excretion into urine. It was proposed that male urine 
has about 3 times more MUPs than female urine (Shaw et al. 
1983; Shahan et al. 1987; Stopka et al. 2007). One of the 
function of MUPs is to bind to low molecular weight volatile 
pheromones and affect their transport and release of pher- 
omones in VNO (Stopka et al. 2007). Another function is 
that they act as pheromones and regulate sex-specific social 
behaviors (Beynon and Hurst 2003). Furthermore, MUPs 
were only found in the urine of matured individuals. The ex- 
pression of Mup4 and Mup5 has been identified in the nasal 
tissue in mouse (Roberts et al. 201 0), and we found them to 
be expressed in the OE of juvenile and adult mice. The abun- 
dant expression of Mup4 and Mup5 based on our data in- 
dicate that the two genes may play an important role in 
MOS. 

One possible explanation of the expression of two Mups 
in OE is that these two proteins bind to odorant molecules in 
nasal and act as OBPs, as suggested by Utsumi et al. (1 999). 
We compared the protein sequences of MUP4 and MUP5 
with other OBPs in mouse and found that the protein se- 
quences of MUPs have highly diverged from OBPs (fig. 4). 



Thus, MUP4 and MUP5 are very likely to have been function- 
ally differentiated from the OBPs. Furthermore, the activa- 
tion of accessory olfactory bulbs was observed when OE was 
stimulated by certain odorants, and it was abolished in re- 
sponse to urine when OE underwent a lesion treatment 
(Utsumi et al. 1999). The above evidence indicates a role 
of MOS in pheromone detection. 

Supplementary Material 

Supplementary figures S1-S4 and tables S1-S6 are available 
at Genome Biology and Evolution online (http://www. 
gbe.oxfordjournals.org/). 

Acknowledgments 

Thanks to Jiun-Jie Chen, Chi-Wei Chang, and Yi-Wei Huang 
for providing experimental assistance that has contributed 
to the data reported in this paper. Dr Michael Baum at Bos- 
ton University provided inspiring discussions for the study. 
This work was supported by Academia Sinica and National 
Science Council (NSC) of Taiwan (NSC 99-2628-B-001 -009- 
MY3, NSC 99-231 1 -B-400-003-MY2, NSC 1 00-231 9-B- 
400-001, and NSC 98-231 1-B-492-001-MY3). 

Literature Cited 

Al-Shahrour F, Diaz-Uriarte R, Dopazo J. 2004. FatiGO: a web tool for 

finding significant associations of Gene Ontology terms with groups 

of genes. Bioinformatics 20:578-580. 
Beynon RJ, Hurst JL. 2003. Multiple roles of major urinary proteins in the 

house mouse, Mus domesticus. Biochem Soc Trans. 31:142-146. 
Bocskei Z, et al. 1992. Pheromone binding to two rodent urinary 

proteins revealed by X-ray crystallography. Nature 360:186-188. 
Briand L, Blon F, Trotier D, Pernollet JC. 2004. Natural ligands of hamster 

aphrodisin. Chem Senses. 29:425-430. 
Briand L, Trotier D, Pernollet JC. 2004. Aphrodisin, an aphrodisiac lipocalin 

secreted in hamster vaginal secretions. Peptides 25:1545-1552. 
Briand L, et al. 2000. Ligand-binding properties and structural 

characterization of a novel rat odorant-binding protein variant. Eur 

J Biochem. 267:3079-3089. 
Chang AY, Liao BY. 2011. DNA methylation rebalances gene dosage 

after mammalian gene duplications. Mol Biol Evol. 29:133-144. 
Clowney EJ, et al. 201 1 . High-throughput mapping of the promoters of 

the mouse olfactory receptor genes reveals a new type of 

mammalian promoter and provides insight into olfactory receptor 

gene regulation. Genome Res. 21:1249-1259. 
Flower DR, North AC, Sansom CE. 2000. The lipocalin protein family: 

structural and sequence overview. Biochim Biophys Acta. 1482:9-24. 
Godfrey PA, Malnic B, Buck LB. 2004. The mouse olfactory receptor 

gene family. Proc Natl Acad Sci USA. 101:2156-2161. 
Haga S, et al. 201 0. The male mouse pheromone ESP1 enhances female 

sexual receptive behaviour through a specific vomeronasal receptor. 

Nature 466:118-122. 
Herrada G, Dulac C. 1997. A novel family of putative pheromone 

receptors in mammals with a topographically organized and sexually 

dimorphic distribution. Cell 90:763-773. 
Johnson MA, Hernandez I, Wei Y, Greenberg N. 2000. Isolation and 

characterization of mouse probasin: an androgen-regulated protein 



Genome Biol. Evol. 4(5):703-712. doi:10.1093/gbe/evs039 Advance Access publication April 17, 2012 



711 



Shiao et al. 



GBE 



specifically expressed in the differentiated prostate. Prostate 
43:255-262. 

Kaessmann H, Vinckenbosch N, Long M. 2009. RNA-based gene 
duplication: mechanistic and evolutionary insights. Nat Rev Genet. 
10:19-31. 

Kurtovic A, Widmer A, Dickson BJ. 2007. A single class of olfactory 

neurons mediates behavioural responses to a Drosophila sex 

pheromone. Nature 446:542-546. 
Li H, et al. 2009. The Sequence Alignment/Map format and SAMtools. 

Bioinformatics 25:2078-2079. 
Lim MM, et al. 2004. Enhanced partner preference in a promiscuous 

species by manipulating the expression of a single gene. Nature 

429:754-757. 

Lynch M, Force A. 2000. The probability of duplicate gene preservation 

by subfunctionalization. Genetics 154:459-473. 
Magklara A, et al. 2011. An epigenetic signature for monoallelic 

olfactory receptor expression. Cell 145:555-570. 
Meslin C, et al. 201 1 . The evolutionary history of the SAL1 gene family 

in eutherian mammals. BMC Evol Biol. 11:148. 
Papp B, Pal C, Hurst LD. 2003. Dosage sensitivity and the evolution of 

gene families in yeast. Nature 424:194-197. 
Pes D, et al. 1998. Cloning and expression of odorant-binding proteins 

la and lb from mouse nasal tissue. Gene 212:49-55. 
Qian W, Liao BY, Chang AY, Zhang J. 2010. Maintenance of duplicate 

genes and their functional redundancy by reduced expression. 

Trends Genet. 26:425-430. 
Roberts SA, et al. 2010. Darcin: a male pheromone that stimulates 

female memory and sexual attraction to an individual male's odour. 

BMC Biol. 8:75. 

Rubenstein DR, Lovette IJ. 2009. Reproductive skew and selection on 
female ornamentation in social species. Nature 462:786-789. 

Serizawa S, Miyamichi K, Sakano H. 2004. One neuron-one receptor 
rule in the mouse olfactory system. Trends Genet. 20:648-653. 

Shahan K, Gilmartin M, Derman E. 1987. Nucleotide sequences of 
liver, lachrymal, and submaxillary gland mouse major urinary 
protein mRNAs: mosaic structure and construction of panels of 
gene-specific synthetic oligonucleotide probes. Mol Cell Biol. 
7:1938-1946. 



Shaw PH, Held WA, Hastie ND. 1 983. The gene family for major urinary 
proteins: expression in several secretory tissues of the mouse. Cell 
32:755-761. 

Shiao MS, Liao BY, Long M, Yu HT. 2008. Adaptive evolution of the 

insulin two-gene system in mouse. Genetics 178:1683-1691. 
Stopka P, Janotova K, Heyrovsky D. 2007. The advertisement role of 

major urinary proteins in mice. Physiol Behav. 91:667-670. 
Stopkova R, et al. 201 0. Novel OBP genes similar to hamster Aphrodisin 

in the bank vole, Myodes glareolus. BMC Genomics 11:45. 
Tamura K, et al. 201 1 . MEGA5: molecular evolutionary genetics analysis 

using maximum likelihood, evolutionary distance, and maximum 

parsimony methods. Mol Biol Evol. 28:2731-2739. 
Tegoni M, et al. 2000. Mammalian odorant binding proteins. Biochim 

Biophys Acta. 1482:229-240. 
Touhara K. 2007. Molecular biology of peptide pheromone production 

and reception in mice. Adv Genet. 59:147-171. 
Toung JM, Morley M, Li M, Cheung VG. 201 1 . RNA-sequence analysis of 

human B-cells. Genome Res. 21:991-998. 
Trapnell C, Pachter L, Salzberg SL. 2009. TopHat: discovering splice 

junctions with RNA-Seq. Bioinformatics 25:11 05-1 111. 
Trapnell C, et al. 2010. Transcript assembly and quantification by RNA- 
Seq reveals unannotated transcripts and isoform switching during 

cell differentiation. Nat Biotechnol. 28:511-515. 
Utsumi M, et al. 1999. Expression of major urinary protein genes in the 

nasal glands associated with general olfaction. J Neurobiol. 39:227-236. 
Wyart C, et al. 2007. Smelling a single component of male sweat alters 

levels of Cortisol in women. J Neurosci. 27:1261-1265. 
Xiong Y, et al. 2010. RNA sequencing shows no dosage compensation 

of the active X-chromosome. Nat Genet. 42:1043-1047. 
Yang J, Lusk R, Li WH. 2003. Organismal complexity, protein complexity, 

and gene duplicability. Proc Natl Acad Sci USA. 100:15661-15665. 
Zhang X, et al. 2004. High-throughput microarray detection of olfactory 

receptor gene expression in the mouse. Proc Natl Acad Sci USA. 

101:14168-14173. 
Zhou S, Stone EA, Mackay TF, Anholt RR. 2009. Plasticity of the 

chemoreceptor repertoire in Drosophila melanogaster. PLoS Genet. 

5:e1000681. 

Associate editor: Takashi Gojobori 



712 Genome Biol. Evol. 4(5):703-712. doi:10.1093/gbe/evs039 Advance Access publication April 17, 2012 



